Cosmological constraints on Brans-Dicke theory 
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We report strong cosmological constraints on the Brans-Dicke (BD) theory of gravity using Cosmic 
Microwave Background data. We consider two types of models. First, the initial condition of 
the scalar field is fixed to give the same effective gravitational strength G e // today as the one 
measured on the Earth, Gn- In this case the BD parameter uj is constrained to uj > 177 at the 
95% confidence level, improving by a factor of two over previous constraints. In the second type 
the initial condition for the scalar is a free parameter leading to a somewhat stronger constrain of 
u> > 288 while G e ff is constrained to 0.97 < G e ///Gjv < 1.22 at the 95% confidence level. We argue 
that these constraints have greater validity than for the BD theory and are valid for any Horndeski 
theory which approximates BD on cosmological scales. In this sense, our constraints place strong 
limits on possible modifications of gravity that might explain cosmic acceleration. 



Introduction The Brans-Dicke theory of gravity 
(BDT), [3, Uj] is one of the simplest extensions of General 
Relativity (GR) depending on one additional parameter, 
uj. In addition to the metric, the gravitational field is 
further mediated by a scalar field 4> whose inverse plays 
the role of a spacetime varying gravitational strength. 

The importance of BDT lies beyond its level of simplic- 
ity, in that it is the limit of more sophisticated but also 
more realistic and physically motivated theories. Its im- 
mediate generalizations, the so called scalar-tensor the- 
ories 0, [j] , have had strong theoretical support from a 
variety of perspectives. For example, they manifest in the 
low-energy effective action for the dilaton-graviton sector 
in supergravity [||. More generally, in compactifications 
of theories with extra dimensions, for instance Kaluza- 
Klein (KK) type theories Q or the Dvali-Gabadadze- 
Porrati theory [7| , the extra-dimensional spacetime met- 
ric is decomposed in KK modes acting as effective scalars 
on our 4-dimensional spacetime in the same way that oc- 
curs in scalar-tensor theories. The BDT is a close cousin 
of the so called Galileon theories Q, recently proposed 
to explain cosmic acceleration while evading solar system 
constraints. In absence of matter fields, the scalar-tensor 
action also arises as a special sector of the Plebanski 
action when the trace component of the simplicity con- 
straints is relaxed 0, [l(| • Finally as we discuss further 
below, BDT arises as a particular limit of Horndeski the- 
ory [ll|, H2] , the most general scalar-tensor theory having 
2nd-order field equations in four dimensions. 

Solar system data put very strong constraints on the 
BD parameter uj. The measurement of the Parameter- 



ized Post- Newtonian parameter 7 (see [13|, [lj]) from the 
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Cassini mission gives uj > 40000 at the 2a level 
On cosmological scales, however, the story is somewhat 
different. Nagata et al. [16] report that uj > {50, 1000} 
at 4a and 2a respectively using Wilkinson Microwave 
Anisotropy Probe (WMAP) 1st year data. However, as 
argued in 17j], their 2a result is not reliable as the re- 
ported x 2 nas a sharp step form, and rather, one should 
take the 4a result as a more conservative estimate. Better 
constraints come from Acquaviva et al. [l7j] who report 



uj > {80, 120} at the 99% and 95% level respectively by 
using a combination of Cosmic Microwave Background 
(CMB) data from WMAP 1st year and a set of small scale 
experiments as well as large scale structure (LSS) data. 
Finally, Wu et al. [Uj] reported uj > 97.8 at the 95% level 
using a combination of CMB data from 5 years of WMAP 
and other smaller scale CMB experiments and LSS mea- 
surements from the Sloan Digital Sky Survey (SDSS) Re- 
lease 4 []j|. Their constraint is weaker than [I?]] even 
though newer data are used. As argued in [IH this is the 

—) rather than 
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use of fiat priors on ln(l 

Given that solar system data provide a far superior 
bound on uj, why constrain the BDT with cosmologi- 
cal data? There are two reasons why this is important. 
Firstly, cosmological constraints on BDT are important 
as they concern very different spatial and temporal scales. 
Secondly, as we discuss further below, BDT can be con- 
sidered as an approximation to a subset of Horndeski- 
type theories, and thus, cosmological constraints on BDT 
can be interpreted in a more general setting. 

The model The BDT is described by the action 



S = 
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where g is the metric determinant, R is the scalar curva- 
ture, A the cosmological constant, G is the bare gravita- 
tional constant and S m is the matter action. The matter 
action is independent of (f>, hence the weak equivalence 
principle is satisfied. We have chosen <f> to be dimension- 
less by convention. In the limit uj — ¥ 00 the theory tends 
to GR. 

The relevant equations to be solved may be found in Q 
and here we quote only the Friedman equation which is 

\ H+ 2l) = T P+ 4 (2 " + 3) {-$) (2) 

where H is the Hubble rate and p is the total matter 
density including A, and the background scalar equation 
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where P is the matter pressure including A. We only 
consider uj > — § since otherwise the scalar is a ghost. 
As it happens, the field stays constant during the radia- 
tion era because © is sourced by p — 3P resulting to 4> 
behaving like a massless scalar. As the Universe enters 
the matter era, however, (f> grows but only logarithmi- 
cally with the scale factor a. Thus, the scalar field today, 
4>o, is expected to be within a few percent of its initial 
value in the deep radiation era. 

If the scalar is approximately constant then the Fried- 
man equation becomes 3H 2 m 8irG e ffp where the effec- 
tive cosmological gravitational strength is given by £ = 
G e ff/G = l/(f>. For bound systems in the quasi-static 
regime, e.g. our solar system, the effective Newton's con- 
stant is GV = G(2oj + 3)/(2cj + 4), thus, observers in a 
bound system which formed today will measure the same 
cosmological and local gravitational strength if 
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We call such models restricted since to achieve the above 
condition the initial value of the scalar field, fa, must 
be appropriately fixed. Models for which fa is a free 
parameter will be called unrestricted. 

Analysis and methodology We numerically solved the 
FRW and linearized equations in the Jordan frame where 
the matter equations of motion are unchanged from their 
GR counterparts. To test the numerical results we imple- 
mented the synchronous gauge equations for scalar modes 
in a modified version of the CAMB package [20[ and com- 
pared with our own Boltzmann code (derived from CMB- 
Fast 2l| and DASh 22]) in which both the synchronous 
gauge and the conformal Newtonian gauge was used. 

We generated a chain of steps in parameter space 
by employmg tbe Markov-Chain Monte-Carlo (MCMC) 
method 
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using the Metropolis-Hastings algo- 
rithm |25j . The MCMC method guarantees that after an 
infinite number of steps the distribution of states visited 
by the algorithm is identical to the underlying probability 
distribution. Moreover, the time needed to sample a dis- 
tribution grows approximately linearly with the dimen- 
sion of the parameter space. For these reasons, the use of 
MCMC methods for parameter estimation in cosmology 
has become common practice and a variety of implemen- 
tations exist [26l - [29| | . In this work, we used the publicly 
available MCMC implementation CosmoMC pj ]. 

In practice, the chains obtained have finite length and 
one must test how closely they have converged to the 
underlying probability distribution. A variety of conver- 
gence tests are provided by CosmoMC. Our chains were 
long enough not only to pass the convergence diagnostics 
but also to give very accurate ID and 2D marginalized 
posterior distributions. 

We used CMB data from WMAP-7 0, South Pole 
Telescope (SPT) [3l[ and a compilation of other small 
scale CMB experiments, namely the Arcminute Cosmol- 
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FIG. 1: Left: The ID marginalized posterior of £ for 
WMAP7 + HST (dashed), with SPT (dotted) and all CMB 
(solid). Right: The ID marginalized posterior of logo) for 
the restricted (solid) and the unrestricted cases (dashed) for 
WMAP7 with/without SPT+HST data (thin/thick). 

ogy Barometer Receiver (32|, Balloon Observations Of 
Millimeter Extragalactic Radiation And Geophysics 33, 
34 1 and Cosmic Background Imager [35j . 

The chains were generated for two types of models, 
the restricted and unrestricted models. Restricted mod- 
els have 7 parameters which are the dimensionless baryon 
and dark matter densities ujb and uj c respectively, the ra- 
tio of the angular diameter distance to the sound horizon 
at recombination 9, the reionization redshift z re , the am- 
plitude and spectral index of the primordial power spec- 
trum A s and n s respectively and the BDT parameter 
uj. The (dimensionless) cosmological constant density uja 
and the Hubble constant Hq are derived parameters. Un- 
restricted models have one additional parameter which is 
the initial condition fa. When generating likelihoods for 
the SPT data, we also use the three parameters D^ , 
.D3000 and D^q 0Q which respectively normalize the Clus- 
ter, Poisson and Sunyaev-Zel'dovich templates [3lj ]. 

We now turn to the issue of priors. For the non-BDT 
parameters we assume the same priors as for ACDM since 
the two types of cosmological evolution are very similar. 
A prior on Hq from the measurement of the angular di- 
ameter distance at redshift z — 0.04 ]36] is also imposed 
for some chains. For the BDT parameters we impose flat 
priors on fa and on — ln{uS). The later also makes it more 
convenient to sample the chains, as was also done in [l7| . 
In [18| flat priors on y = ln(I + -M were used, in order 
to accommodate negative uj. However, since uj < — | is 
a ghost we see little reason for this. Furthermore, since 
dy ss — d - -j - , using y rather than \nuj penalizes models 
with large ui (explains the weaker constraints found in 
fljj ]) which we feel is rather artificial. 

Results and discussion Let us first present the results 
for the unrestricted models, for reasons to be discussed 
in more detail further below. Using WMAP7 alone we 
report u > {99.4,55.4} and £ = {0.95^:^,0.95^;^} 
at the 95% and 99% level respectively, while if the HST 
prior is assumed this is changed to uj > {125.6,61.9} 
and £ = {1.07T.o!24; l-O^iaS} at tne same corresponding 
levels. Including SPT data along with WMAP7 and HST 
prior strengthens the constraints to uj > {269.3, 147.6} 
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and £ = {l.lOI^t ^-o.w} while further including the 
"high-f data compilation improves to lu > {288.2,152.3} 
and ( = {lmtoil, i-O^ai?}- Constraints on G from 
CMB can be found in [37j where 0.74 < £ < 1.66 is found 
from WMAP-1 alone at the 95% of confidence level. Our 
results improve on (37j and further put them in context 
with a realistic theory. 

Clearly, including SPT data greatly improves the con- 
straints from just using WMAP. This is mainly due to 
a better determination of 9 by constraining the location 
of more CMB peaks but also a better determination of 
the damping envelope and of the peak heights. As dis- 
cussed in [38l ] . the main effect of increasing £ is to in- 
crease the width of the visibility function which in turn 
increases photon diffusion and damps the CMB temper- 
ature anisotropics on small scales. Thus the main con- 
straints on £ from CMB temperature come from con- 
straining the damping envelope with SPT data. Increas- 
ing £ has a slightly different effect on polarization. The 
same damping effect occurs on small scales but on large 
scales we get an enhancement as a thicker last scattering 
surface increases the amplitude of the local quadrupole 
which produces larger polarization signal [381 ] . 

The effect of varying lu is different from varying £. 
Varying lu results to a change in the background expan- 
sion history H which results in a shift of the peak loca- 
tions and peak heights. It makes more sense, however, 
to consider the changes in the spectrum at fixed 9 as we 
use it as a parameter. In that case, the dominant effect 
is on large scales due to the Integrated Sachs- Wolfe effect 
while small scales are affected due to the perturbations in 
<p and are of lesser importance. Thus, the improvement 
to the constraints on lu coming from SPT data is indirect 
and is due to reducing the allowed region by constraining 
£ as well as the other parameters. 

Let us now turn to the restricted class of models. Us- 
ing WMAP7 alone we report u > {89.7, 51.0} at the 95% 
and 99% level respectively, while if the HST prior is as- 
sumed we improve to uj > {177.3, 120.0} at the same 
corresponding confidence levels. Including SPT data, 
does not improve the bounds as one would expect but 
rather we find lu > {156.5,114.1}. although using the 
"high-f data compilation makes things slightly better 
to u> > {165.7, 122.8}. We see a significant improvement 
over [17| , driven mainly by the inclusion of polarization 
data which break the degeneracy between z re and n a and 
allow the measurements of the damping tail to improve 
the determination of the other parameters and limit the 
freedom of lu to vary. 

Why do constraints on the restricted class not improve 
with including more data but rather become less con- 



strained than the unrestricted class which has one more 
parameter? In fact, our WMAP+HST results are com- 
patible with the forecasting of [39] but given that [39I ] 
expect lu to be constrained close to ~ 1000 with Planck, 
it is reasonable to expect values around 500 — 700 with 
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FIG. 2: The 2D marginalized posterior distributions for the 
In a; — uia (left) and lncj — £ (right) planes at the 95% confidence 
level using WMAP7 + SPT +HST data. On the left, the 
wider set of contours are for the restricted model, the contours 
in the middle are for the unrestricted model and the front set 
for a fictitious restricted case where £ today was fixed to be 
the best fit from the unrestricted case. 

the inclusion of SPT data, contrary to what we find in 
practice. The reason we see no improvement with the 
inclusion of SPT data is uncovered on the right panel of 
figure [TJ The marginalized distribution of In lu exhibits 
a peak around lu ~ 400, however, the difference in like- 
lihood between the GR limit and this peak is very small 
which renders this "detection" insignificant. However, its 
presence makes it difficult to improve the lower bound of 
lu unless a different data set is included. 

To understand this better recall that in the unre- 
stricted case the preferred value of £ deviates from unity 
at around 10%. However, as in the restricted model 
£ w 1, this discrepancy in £ between the two types of 
models cuts-off a large portion of good likelihood in pa- 
rameter space. This is shown on the right panel of figure 
[5] where for values of £ < 1 there is a small degeneracy 
between lncj and £. In order to verify this assertion we 
consider a fictitious restricted model where 4>i is fixed so 
that £ is the best fit from the unrestricted model. As ex- 
pected, the best-fit sample for this fictitious model lies in 
the ACDM large lu limit and the constraint on omega be- 
comes lu > 471 at the 95% level which is notably stronger 
than both the realistic models and also in line with the 
forecasting of [3{| . A degeneracy also exists between In lu 
and ua as shown on the left panel of figure [2J which is 
lifted in the fictitious model. This suggests that improved 
constraints on BDT can be obtained by including a dif- 
ferent data set to CMB which can determine lu\ more 
precisely. 
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Implications for modified gravity Regardless the sim- 
plicity of this model, our constraints are more gener- 
ally valid as the BDT can be considered an approxima- 
tion to the most general scalar-tensor theory [ll], 12 1, 
above some length scale £*. The gravitational action is 
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J d 4 Xy/^ gJ2i=o where ip is by con- 



vention dimensionless and L^ 1 ' are given by 

L (2) = K (2) R + K (2) _ ^2] 

L (3) = -6K {3) G^VpV^V 

+k { x ) [(a/-) 3 - 3(D^) 2 O0 + 2(DV) 3 ] 

where X = -ig^V '^V ' v ij}, Dip = V ® and IfW 

are functions of X and t/>. We may expand as 

a Taylor series and the lowest order terms are 

-2A + 8wA + e^ 2 /£2 + £2 ^2 X 2 ; X (i) w e ^2 + e4£ 2 X ^ 

K<$ =ip 2 + e 5 ^ 4 + e 6 ilX and K^/il = e 7 tfj 2 + e s tlX. 
As £j — >■ and letting <f> — ip 2 we recover the BDT. 
We have ignored the constant terms in K^- 1 ' and 
as they lead to total derivatives. The constant term in 
cannot be ignored in general but would lead to GR 
coupled to a massless scalar as a — > and is irrelevant 
to our work. Thus our results hold for any Horndeski 
theory which can be approximated in the above form 
on cosmological scales. Choosing ~ l/16i?o (Hub- 
ble scale at recombination) and using the BDT solution 
as (j> — — and ~ 1 we find conservative estimates as 
ei < {10r 2 , 10 5 , 10, 10 5 , 1, 10 3 , 10 3 , 10 5 } in order for the 
additional terms to be smaller than the BDT term. 

Conclusions We found strong constraints on the BDT 
parameter u > 288 and effective gravitational strength 
£ = 1.09lg 'J 3 , at the 95% confidence level, significantly 
improving on previous work. Improvement on these 
bounds is expected through the inclusion of LSS and 
weak-lensing data which is left to future work. 
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